Award  Number:  W8  1XWH-07-1-0059 


AD 


TITLE;  Evaluation  of  Fibroblast  Activation  Protein-Alpha  (FAP)  as  a  Diagnostic  Marker 
and  Therapeutic  Target  in  Prostate  Cancer 


PRINCIPAL  INVESTIGATOR:  Nathaniel  Brennen 


CONTRACTING  ORGANIZATION:  Johns  Hopkins  University 

Baltimore,  MD  21218-2686 


REPORT  DATE:  December  2007 


TYPE  OF  REPORT:  Annual 


PREPARED  FOR;  U.S.  Army  Medical  Research  and  Materiel  Command 
Fort  Detrick,  Maryland  21702-5012 


DISTRIBUTION  STATEMENT:  Approved  for  Public  Release; 

Distribution  Unlimited 


The  views,  opinions  and/or  findings  contained  in  this  report  are  those  of  the  author(s)  and 
should  not  be  construed  as  an  official  Department  of  the  Army  position,  policy  or  decision 
unless  so  designated  by  other  documentation. 


REPORT  DOCUMENTATION  PAGE 


Form  Approved 
0MB  No.  0704-0188 


Public  reporting  burden  for  this  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and  maintaining  the 
data  needed,  and  completing  and  reviewing  this  collection  of  Information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information,  including  suggestions  for  reducing 
this  burden  to  Department  of  Defense,  Washington  Headquarters  Services,  Directorate  for  Information  Operations  and  Reports  (0704-0188),  1215  Jefferson  Davis  Highway,  Suite  1204,  Arlington,  VA  22202- 
4302.  Respondents  should  be  aware  that  notwithstanding  any  other  provision  of  law,  no  person  shall  be  subject  to  any  penalty  for  failing  to  comply  with  a  collection  of  information  if  it  does  not  display  a  currently 
valid  OMB  control  number.  PLEASE  DO  NOT  RETURN  YOUR  FORM  TO  THE  ABOVE  ADDRESS. 


1 .  REPORT  DATE  (DD-MM-YYYY)  2.  REPORT  TYPE  3.  DATES  COVERED  (From  -  To) 

01-12-2007  Annual  14  MAR  2006  -  28  JUN  2007 


4.  TITLE  AND  SUBTITLE  5a.  CONTRACT  NUMBER 


Evaluation  of  Fibroblast  Activation  Protein-Alpha  (FAP)  as  a  Diagnostic  Marker  and 
Therapeutic  Target  in  Prostate  Cancer 


5b.  GRANT  NUMBER 

W8  1XWH-07-1-0059 


5c.  PROGRAM  ELEMENT  NUMBER 


6.  AUTHOR(S) 

Nathaniel  Brennen 


5d.  PROJECT  NUMBER 


5e.  TASK  NUMBER 


E-Mail:  wbrenne2@jhmi.edut 


5f.  WORK  UNIT  NUMBER 


7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS(ES) 

Johns  Hopkins  University 
Baltimore,  MD  21218-2686 


8.  PERFORMING  ORGANIZATION  REPORT 
NUMBER 


9.  SPONSORING  /  MONITORING  AGENCY  NAME(S)  AND  ADDRESS(ES) 

U.S.  Army  Medical  Research  and  Materiel  Command 
Fort  Detrick,  Maryland  21702-5012 


12.  DISTRIBUTION  /  AVAILABILITY  STATEMENT 

Approved  for  Public  Release;  Distribution  Unlimited 


10.  SPONSOR/MONITOR’S  ACRONYM(S) 


11.  SPONSOR/MONITOR’S  REPORT 
NUMBER(S) 


14.  ABSTRACT 

Abstract  provided  on  next  page. 


15.  SUBJECT  TERMS 

Prodrug,  Fibroblast,  Protease 


16.  SECURITY  CLASSIFICATION  OF: 

17.  LIMITATION 

18.  NUMBER 

19a.  NAME  OF  RESPONSIBLE  PERSON 

OF  ABSTRACT 

OF  PAGES 

USAMRMC 

a.  REPORT 

b.  ABSTRACT 

c.  THIS  PAGE 

19b.  TELEPHONE  NUMBER  (include  area 

u 

u 

u 

uu 

21 

code) 

2 


Standard  Form  298  (Rev.  8-98) 
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Technical  Abstract: 


Bacl^round:  It  has  been  well  documented  that  the  tumor  is  dependent  upon  the  reactive  stroma  for  survival 
and  growth  signals,  as  well  as,  the  nutritional  support  necessary  for  the  maintenance  of  the  primary  mass. 
Additionally,  the  ability  of  the  stroma  to  not  only  contribute  to,  but  potentially  drive,  the  progression  of 
cancerous  cells  into  a  highly  aggressive  and  metastatic  phenotype  is  a  concept  that  has  only  recently  begun  to  be 
appreciated.  The  stroma  has  been  shown  to  undergo  morphological  alterations,  recruit  reactive  fibroblasts, 
macrophages,  and  lymphocytes,  increase  secretion  of  growth  factors,  signaling  molecules  and  proteases,  induce 
new  blood  vessel  formation,  as  well  as,  produce  an  altered  extracellular  matrix  when  associated  with  a 
transformed  epithelium.  Fibroblasts,  in  particular,  have  been  shown  to  consistently  undergo  several  changes  in 
both  morphology  and  expression  profiles  when  present  in  the  tumor  microenvironment.  One  defining 
characteristic  of  these  carcinoma-associated  fibroblasts,  or  myofibroblasts,  is  the  expression  of  fibroblast 
activation  protein-alpha  (FAP).  FAP  is  a  membrane-bound  serine  protease  that  has  both  dipeptidase,  as  well  as, 
gelatinase  and  collagenase  activity.  FAP  is  not  expressed  in  healthy  adults,  but  has  been  shown  to  be 
selectively  expressed  on  myofibroblasts  in  the  stroma  surrounding  >90%  of  epithelial  cancers  examined, 
including  7/7  human  prostate  cancer  specimens,  with  minimal  to  no  expression  in  either  cancerous  epithelial  or 
adjacent  normal  tissues.  FAP  has  been  implicated  in  tumor  promotion  through  studies  demonstrating  increases 
in  tumor  incidence,  growth,  and  microvessel  density  using  in  vivo  models.  In  contrast,  other  studies  have 
shown  that  expression  of  FAP  decreased  tumorigenicity  in  vivo  suggesting  that  the  physiologic  response  to  FAP 
may  be  dependent  upon  the  exact  context  of  its  expression. 

Objective/Hypothesis:  Our  goal  is  to  evaluate  FAP  expression  patterns  and  enzymatic  activity  in  both  normal 
prostate  tissue  and  at  various  stages  of  oncogenic  transformation  (i.e.  PIA,  PIN,  Localized  and  Advanced 
Cancer)  to  determine  tumor  stage  in  which  FAP  expression  may  play  a  role.  A  second  objective  is  to  exploit 
this  expression  in  the  treatment  of  prostate  cancer  by  developing  therapies  targeted  for  activation  by  FAP.  This 
will  be  accomplished  by  identifying  selective  peptide  substrates  for  the  proteolytic  activity  of  FAP  and  coupling 
these  peptides  to  a  highly  cytotoxic  agent,  thapsigargin,  to  generate  prodrugs  that  are  only  activated  in  prostate 
tumors  where  FAP  is  expressed. 

Specific  Aims: 

1)  Characterize  the  expression  patterns  of  FAP  in  normal  prostate  tissue  and  in  the  disease  state. 

2)  Identify  a  FAP-specific  cleavable  peptide  sequence  for  the  development  of  targeted  prodrugs. 

3)  Synthesize  prodrug  and  measure  kinetics  in  vitro. 

4)  Determine  the  potential  therapeutic  benefit  and  toxicity  of  this  prodrug  in  vivo. 

Study  Design:  We  will  generate  and  characterize  clones  of  various  prostate  cancer  cell  lines  (LNCaP,  C4-2, 
C4-2B4,  and  PCS)  expressing  FAP  or  an  empty  vector  control.  Primary  human  prostate  cancer  tissue 
specimens  representing  varying  stages  of  prostate  cancer,  as  well  as,  adjacent  normal  tissue  will  be  stained  for 
FAP  expression  and  quantified.  FAP-specific  peptide  substrates  will  be  identified  in  a  complementary  approach 
using  a  solution-phase  phage  display  library  and  a  positional  scanning  synthetic  combinatorial  library.  Optimal 
substrate  sequences  will  be  coupled  to  thapsigargin,  a  highly  cytotoxic  compound.  Kinetics  of  hydrolysis,  as 
well  as,  specificity  and  stability  will  be  measured  in  vitro  using  the  FAP-expressing  clones.  Lead  prodrugs 
identified  in  vitro  will  be  used  to  treat  xenografts  derived  from  FAP-expressing  clones  and  mock-transfected 
controls  to  determine  potential  therapeutic  efficacy  in  vivo. 

Impact:  Metastatic  epithelial  cancers,  such  as  those  of  the  prostate,  are  composed  of  heterogeneous  populations 
of  cells  that  can  have  variable  responses  to  anti-tumor  agents.  Currently  utilized  standard  antiproliferative 
chemotherapies  can  produce  modest  improvement  in  survival  in  select  cancer  types,  but  are  largely  ineffective 
in  treating  malignancies  of  the  prostate  (proliferative  index  <  5%).  Novel  therapies  that  act  in  a  proliferation- 
independent  manner,  therefore,  are  urgently  needed  for  the  treatment  of  advanced  prostate  cancer.  An  emerging 
strategy  has  been  to  target  the  stromal  components  associated  with  the  tumor.  One  potential  therapy  is  the 
development  of  prodrugs  activated  by  proteases  selectively  expressed  in  the  stromal  compartment  as  outlined  in 
this  proposal.  This  approach  will  effectively  deliver  a  cytotoxic  molecule,  such  as  thapsigargin,  in  a  highly 
specific  manner  to  the  tumor  microenvironment  and  will  directly  result  in  significant  stromal  cell  death  and 
indirectly  in  epithelial  mortality  due  to  a  bystander  effect  with  limited  systemic  toxicity. 
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Introduction 


It  has  been  well  documented  that  the  tumor  is  dependent  upon  the  reactive  stroma  for 
survival  and  growth  signals,  as  well  as,  the  nutritional  support  necessary  for  the  maintenance  of 
the  primary  mass.  Additionally,  the  ability  of  the  stroma  to  not  only  contribute  to,  but  potentially 
drive,  the  progression  of  cancerous  cells  into  a  highly  aggressive  and  metastatic  phenotype  is  a 
concept  that  has  only  recently  begun  to  be  appreciated.  The  stroma  has  been  shown  to  undergo 
morphological  alterations,  recruit  reactive  fibroblasts,  macrophages,  and  lymphocytes,  increase 
secretion  of  growth  factors,  signaling  molecules  and  proteases,  induce  new  blood  vessel 
formation,  as  well  as,  produce  an  altered  extracellular  matrix  when  associated  with  a  transformed 
epithelium.  Fibroblasts,  in  particular,  have  been  shown  to  consistently  undergo  several  changes 
in  both  morphology  and  expression  profiles  when  present  in  the  tumor  microenvironment.  One 
defining  characteristic  of  these  carcinoma-associated  fibroblasts,  or  myofibroblasts,  is  the 
expression  of  fibroblast  activation  protein-alpha  (FAP).  FAP  is  a  membrane-bound  serine 
protease  that  has  both  dipeptidase,  as  well  as,  gelatinase  and  collagenase  activity.  FAP  is  not 
expressed  in  healthy  adults,  but  has  been  shown  to  be  selectively  expressed  on  myofibroblasts  in 
the  stroma  surroimding  >90%  of  epithelial  cancers  examined,  including  7/7  human  prostate 
cancer  specimens,  with  minimal  to  no  expression  in  either  cancerous  epithelial  or  adjacent 
normal  tissues.  FAP  has  been  implicated  in  tumor  promotion  through  studies  demonstrating 
increases  in  tumor  incidence,  growth,  and  microvessel  density  using  in  vivo  models.  In  contrast, 
other  studies  have  shown  that  expression  of  FAP  decreased  tumorigenicity  in  vivo  suggesting 
that  the  physiologic  response  to  FAP  may  be  dependent  upon  the  exact  context  of  its  expression. 
Our  goal  is  to  evaluate  FAP  expression  patterns  and  enzymatic  activity  in  both  normal  prostate 
tissue  and  at  various  stages  of  oncogenic  transformation  (i.e.  PIA,  PIN,  Localized  and  Advanced 
Cancer)  to  determine  tumor  stage  in  which  FAP  expression  may  play  a  role.  A  second  objective 
is  to  exploit  this  expression  in  the  treatment  of  prostate  cancer  by  developing  therapies  targeted 
for  activation  by  FAP.  This  will  be  accomplished  by  identifying  selective  peptide  substrates  for 
the  proteolytic  activity  of  FAP  and  coupling  these  peptides  to  a  highly  cytotoxic  agent, 
thapsigargin,  to  generate  prodrugs  that  are  only  activated  in  prostate  tumors  where  FAP  is 
expressed. 
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Body 


Hypothesis  and  Objective 

Our  hypothesis  is  that  FAP  is  expressed  on  the  surface  of  myofibroblasts  in  the  stroma 
surrounding  the  majority  of  malignant  prostatic  lesions  and  that  this  expression  can  be 
taken  advantage  of  for  clinical  benefit.  The  goal  of  this  proposal  is  to  investigate  FAP 
expression  patterns  in  normal  prostate  tissue  and  prostate  cancers  at  various  stages  and  to 
develop  novel  FAP-activated  prodrug  therapies  that  exploit  FAP’s  tumor  stroma  restricted 
expression  in  the  treatment  of  prostate  cancer.  Furthermore,  due  to  the  localized  expression  of 
FAP  to  the  stroma  associated  with  tumors  from  multiple  tissues  of  origin,  this  therapy  may  have 
the  potential  to  effectively  treat  a  broad  spectrum  of  epithelial  malignancies. 


Specific  Aims  1:  Characterize  the  expression  patterns  of  FAP  in  normal  and  hyperplastic 
prostate  tissue  and  in  various  stages  of  prostate  cancer 

Limited  information  is  available  on  expression  of  FAP  in  prostate  tissue.  Initial  studies 
characterizing  FAP  expression  in  epithelial  cancers  did  not  include  prostate  cancer.  The  only 
data  showing  FAP  expression  is  by  Tuxhom  et  al  who  demonstrated  expression  in  7/7  samples  of 
localized  prostate  cancer  taken  from  core  biopsies.  Our  goal,  therefore,  is  to  perform  more 
extensive  analysis  of  FAP  expression  in  normal  prostate,  BPH,  and  various  stages  of  prostate 
cancer  including  premalignant  proliferative  inflammatory  atrophy  (PIA),  prostatic  intraepithelial 
neoplasia  (PIN),  in  addition  to,  various  stages  and  grades  of  PCa,  including  organ-confined, 
androgen-dependent  adenocarcinomas  and  hormone-refractory  metastatic  bone  lesions.  Studies 
to  date  on  human  tissue  have  relied  on  the  commercially  available  F19  mouse  anti-human  FAP 
hybdridoma  which  we  have  available  in  our  laboratory.  A  second  mouse  anti-FAP  antibody  has 
been  purchased  from  Bender  MedSystems.  Both  antibodies  only  recognize  native  FAP  and  only 
work  on  frozen  tissue  and  in  FACS  analysis.  The  goal  of  these  studies  will  be  to  determine  the 
prognostic  significance  of  FAP  expression  in  human  prostate  cancer  and  to  begin  to  gain  an 
understanding  of  whether  FAP  may  play  a  role  in  the  pathobiology  of  the  disease. 

Recently  a  new  anti-human  FAP  antibody  from  Vitatex  that  has  the  ability  to  recognize  both 
dimeric  and  monomeric  forms  of  native  FAP,  as  well  as,  work  in  paraffin-embedded  tissues  has 
become  commercially  available.  This  antibody  has  been  obtained  from  Vitatex  and  is  currently 
being  validated  in  paraffin-embedded  samples.  Once  validation  of  this  antibody  has  been 
completed  we  can  begin  om*  characterization  of  FAP  expression  patterns  in  both  the  normal 
prostate  and  in  various  disease  states.  The  availability  of  an  antibody  that  can  be  utilized  in 
paraffin-embedded  tissues,  as  opposed  to  frozen  sections  only,  will  allow  us  to  perform  more 
extensive  analyses  of  FAP  expression  in  archived  prostate  tissues  and  microarrays  using  greater 
numbers  of  samples  to  increase  the  statistical  significance  of  any  differences  detected  in  the  data. 
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Specific  Aim  2:  Identify  a  FAP-specific  cleavable  peptide  sequence  for  the  development  of 
targeted  prodrugs  delivering  a  toxin  (i.e.,  thapsigargin). 

Peptide  sequences  specifically  cleaved  by  FAP’s  enzymatic  activity  will  be  identified 
through  the  coordinated  application  of  two  different  techniques.  The  first  approach  will  be  to 
perform  traditional  mapping  of  FAP  cleavage  sites  within  collagen  I,  a  known  FAP  protein 
substrate.  Dr.  Denmeade’s  lab  recently  completed  this  digestive  approach  using  recombinant 
collagen  I  derived  gelatin  produced  in  E  coli  so  that  no  post-translational  modifications  occurred 
(i.e.  proline  hydroxylation).  FAP  cleavage  sites  within  this  gelatin  were  determined  using  LC- 
MS  analysis  performed  in  collaboration  with  Dr.  Robert  Cotter’s  group  at  Johns  Hopkins  who 
specialize  in  mass  spectrometric-based  proteomic  techniques.  This  study  yielded  a  large  nximber 
of  cleavage  sites  within  the  gelatin,  but  given  that  collagen  is  a  heterotrimeric  polymer  made  up 
of  repeating  sequences  containing  the  (GXY)n  motif  (X=  Pro,  Y=  HydroxyPro),  the  cleavage 
sites  were  of  a  limited  repertoire.  To  obtain  more  complete  analysis  of  FAP  substrate 
requirements  we  will  synthesize  a  positional  scanning  synthetic  combinatorial  library  (PSSCL) 
employing  a  4-amino-7-carbamoylmethylcoumarin  (ACC)  based  resin  that  will  enable  the  P1-P6 
subsite  specificity  of  the  putative  FAP  substrates  to  be  probed  in  a  high  throughput  manner.  The 
combined  integration  of  consensus  sequences  detected  through  each  of  these  screens  should 
allow  the  identification  of  highly-specific  FAP  cleavable  peptide  sequences  that  can  be  linked  to 
a  highly  cytotoxic  compound  such  as  thapsigargin  to  generate  a  prodrug  that  is  selectively 
activated  by  FAP-positive  reactive  stromal  cells  within  prostate  cancer  sites  while  avoiding 
toxicity  to  normal  FAP-negative  host  tissues. 

The  cleavage  products  generated  from  the  Collagen  I  derived  gelatin  digestion  were  ranked 
according  to  their  normalized  ion  current.  This  value  was  calculated  by  extracting  the  ion  current 
for  the  parent  mass  of  the  ion  from  the  total  MS  ion  chromatogram  and  then  integrating  under  the 
peak  from  the  liquid  chromatogram.  This  ranking  was  performed  in  an  attempt  to  quantify  the 
relative  abxmdance  of  each  fragment  within  the  mixture  and  consequently  a  preferred  cleavage 
motif  (Figure  4  of  appended  publication).  The  frequency  of  individual  amino  acids  within  the 
P7-Pr  positions  of  each  cleavage  site  were  also  determined  to  reveal  a  consensus  peptide 
containing  optimal  residues  in  each  position  The  basic  pattern  that  emerged  from  this  analysis 
was  (D/E)-(R/K)-G-(E/D)-(A/G/S/T)-G-P-A  or  (acidic  aa)-(basic  aa)-G-(acidic  aa)-(Small/Polar- 
OH  aa)-G-P-A  (Figure  5  of  appended  publication). 

As  a  result  of  these  analyses  we  selected  several  peptides  representing  high,  intermediate, 
and  low  extracted  ion  currents  from  the  gelatin  digestion  products,  as  well  as,  variations  of  the 
consensus  peptide  for  further  study.  Fluorescence-quenched  substrates  based  on  the  selected 
sequences  were  s)nithesized  in  order  to  ascertain  Michaelis-Menten  kinetic  parameters  (i.e.  Km, 
Vmax,  kcat)  of  FAP  hydrofysis.  Substrates  were  ranked  according  to  their  calculated  kcat/Km  ratios 
(Table  2  &  3  of  appended  publication).  In  general,  those  substrates  with  the  highest  extracted  ion 
currents  were  cleaved  the  most  efficiently  by  FAP  indicating  that  this  normalization  method  is  of 
benefit  in  identifying  preferred  cleavage  motifs  from  digestion  products.  The  critical  importance 
of  an  amino  acid  in  the  PI’  position  was  demonstrated  based  on  the  near  abrogation  of  hydrolysis 
in  substrates  lacking  a  residue  in  this  position.  There  was  minimal  improvement  observed  in 
substrates  with  additional  amino  acids  on  the  P’  side  beyond  the  PI’  position.  See  appended 


publication,  ‘Fibroblast  Activation  Protein  (FAP)  Peptide  Substrates  Identified  from  Human 
Collagen  I-derived  Gelatin  Cleavage  Sites’  (Brennen,  2007)  for  additional  information  regarding 
methods  and  a  more  detailed  discussion  of  the  results  found  related  to  this  aim.  Synthesis  of 
several  fluorescence-quenched  substrates  containing  hydroxyproline  in  the  Y  positon  of  the 
collagen  (GXY)n  motif  has  been  performed  in  order  to  determine  the  potential  role  of  this  post- 
translational  modification  in  relation  to  FAP’s  efficiency  of  hydrolysis. 

There  has  been  a  complication  that  has  arisen  in  the  synthesis  of  the  4-amino-7- 
carbamoylmethylcoumarin  (ACC)  fluorophore  for  use  in  the  positional  scanning  synthetic 
combinatorial  library  (PSSCL).  The  compound  itself  has  been  successfully  synthesized  as 
determined  by  nuclear  magnetic  resonance  (NMR),  however,  addition  of  the  Fmoe-protecting 
group  to  the  primary  amine  has  proven  difficult.  This  protecting  group  is  necessary  for  the 
directed  coupling  of  the  carboxylic  acid  moiety  to  the  resin  for  solid-phase  synthesis.  Dave 
Meyers,  director  of  the  Pharmacology  Synthesis  Core  on  campus  has  been  contacted  for  aid  in 
the  completion  of  this  final  step  in  the  synthesis. 

Specific  Aim  3:  Synthesize  prodrug  and  measure  kinetics  in  vitro  using  both  human 
prostate  cancer  cell  lines  and  cell-free  assays. 


Our  goal  for  the  second  year  of  funding  will  be  to  evaluate  selective  FAP  peptide  substrates 
identified  in  Aim  2  linked  via  a  peptide  bond  to  amino  acid  containing  12ADT  analogs. 
Synthesis  of  the  prodrugs  will  not  be  initiated  until  further  refinement  of  a  selective  cleavage 
motif  has  been  completed  based  upon  analysis  of  the  PSSCL  as  outlined  in  Specific  Aim  2.  A 
lentiviral  (LV)  vector  (EF.insert.PGK.GFP)  has  been  obtained  from  Dr.  Linzhao  Cheng  here  at 
Johns  Hopkins  for  use  in  the  generation  of  cell  lines  stably  transfected  with  the  FAP  construct. 
The  full-length  FAP  coding  sequence  has  been  ligated  into  the  LV  vector,  and  I  am  currently 
selecting  and  sequencing  clones  of  the  293T  packaging  cells.  The  LV  vector  offers  the 
advantages  of  greater  efficiency  of  integration  into  the  genome,  as  well  as,  ease  of  stable 
transfectant  selection  through  the  use  of  FACS  sorting  based  on  GFP  expression  as  compared  to 
the  pIRESneoS  vector  as  described  in  the  original  proposal. 

Specific  Aim  4:  Determine  the  potential  therapeutic  benefit  and  toxicity  of  this  prodrug  in 
vivo  using  human  prostate  cancer  xenografts  in  immunocompromised  murine  hosts. 

On  the  basis  of  studies  in  Specific  Aim  3,  further  in  vivo  studies  will  be  performed  on 
prodrugs  (“leads”)  that  possess  the  best  combination  of  FAP  hydrolysis,  stability  in  plasma, 
selective  toxicity  to  FAP-positive  prostate  cancer  cells  and  minimal  non-specific  toxicity  (i.e. 
goal  of  20-60  fold  differential).  These  in  vivo  studies  will  not  be  started  until  the  successful 
completion  of  specific  aims  1-3. 
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Key  Research  Accomplishments 


Digestion  and  analysis  of  human  Collagen  I-derived  gelatin  using  a  tandem  LC-MS/MS 
approach 

Thanking  of  digestion  products  according  to  extracted  ion 

Identification  of  preferred  cleavage  motifs  within  the  human  Collagen  I-derived  gelatin 
digestion  products 

Identification  of  a  consensus  peptide  containing  optimal  residues  in  each  position,  P7- 
Pr.  The  basic  pattern  that  emerged  fi'om  this  analysis  was  (D/E)-(R/K)-G-(E/D)- 
(A/G/S/T)-G-P-A  or  (acidic  aa)-(basic  aa)-G-(acidic  aa)-(SmalL^olar-OH  aa)-G-P-A. 
Synthesis  of  several  fluorescence-quenched  peptide  substrates  based  on  sequences 
identified  and  selected  fi'om  the  digestion  products 

Determination  of  the  Michaelis-Menten  kinetic  parameters  (i.e.  Km,  Vmax,  kcat)  for  FAP 
hydrolysis  of  each  of  these  substrates  and  ranking  according  to  their  kcat/Km  ratios. 
Validation  of  normalization  &  ranking  according  to  the  extracted  ion  current 
demonstrating  a  novel  method  for  identifying  preferred  cleavage  sites 
Requirement  for  residue  occup5dng  PI’  position  of  FAP  substrate  and  minimal 
improvement  with  addition  of  >laa  on  the  P’  side. 


Reportable  Outcomes 

Aggarwal  S,  Brennen  WN,  Kole  TP,  Schneider  E,  Topaloglu  O,  Yates  M,  Cotter  RJ, 
Denmeade  SR.  Fibroblast  Activation  Protein  IFAP)  Peptide  Substrates  Identified  fi-om 
Human  Collagen  I-derived  Gelatin  Cleavage  Sites.  Biochemistry,  2007, 
10.1021/bi701921b  [epub  ahead  of  print] 

Brennen  WN,  Aggarwal  S,  Kole  TP,  Yates  M,  Rosen  M,  Denmeade  SR.  Fibroblast 
Activation  Protein  (FAP:  Targeting  the  Reactive  Stroma  as  a  Novel  Prodrug  Therapy. 
Oct  2007. 5^'’  General  Meeting  of  the  International  Proteolysis  Society;  Uni  versity  of 
Patras,  Greece. 

Brennen  WN,  Aggarwal  S,  Kole,  TP,  Topaloglu  O,  Schneider  E,  Becker  R,  Denmeade 
SR.  Fibroblast  Activation  Protein  IFAP:  Targeting  the  Reactive  Stroma  as  a  Novel 
Prodmg  Therapy.  May  2007.  Fellow  Research  Day,  Sidney  Kimmel  Comprehensive 
Cancer  Center  at  Johns  Hopkins;  Baltimore,  MD.  -  Honorable  Mention 


Conclusion 


The  goals  of  this  proposal  are  (1)  to  characterize  the  expression  of  TAP,  a  serine  protease  that 
is  selectively  expressed  by  reactive  tumor  stromal  fibroblasts,  in  human  prostate  cancers  and  (2) 
to  identify  selective  substrates  for  the  unique  proteolytic  activity  of  FAP  that  can  be  incorporated 
into  peptide-TG  prodrugs  that  are  selectively  activated  only  within  FAP-producing  prostate 
cancer  sites.  The  proposal,  therefore,  has  the  potential  to  impact  prostate  cancer  in  two 
significant  ways.  First,  the  proposal  may  identify  a  role  for  FAP  as  a  prognostic  or,  potentially, 
even  diagnostic  marker.  Second,  the  proposal  describes  a  strategy  to  generate  a  novel  prostate 
cancer  targeted  prodrug  therapy  that  could  selectively  kill  prostate  cancer  cells  while  having 
minimal  side  effects  to  normal  tissue.  We  have  thus  far  identified  consensus  motifs  from  FAP 
digested  human  collagen  I-derived  gelatin  and  ranked  them  according  to  their  Michaelis-Menten 
kinetic  parameters  based  on  the  hydrolysis  of  their  respective  fluorescence-quenched  peptide 
substrates.  Several  promising  candidate  sequences  have  been  identified  and  upon  further 
refinement  using  a  PSSCL  approach  will  be  used  to  generate  potential  FAP-selective  prodrugs. 

References 

See  Attached  Appendix 
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Fibroblast  Activation  Protein  Peptide  Substrates  Identified  from  Human  Collagen  I 

Derived  Gelatin  Cleavage  Sites^ 

Saurabh  Aggarwal,* *’^  W.  Nathaniel  Brennen,"  Thomas  P.  Kole,"  Elizabeth  Schneider,*  Ozlem  Topaloglu,* 
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Department  of  Pharmacology  and  Molecular  Sciences,  The  Johns  Hopkins  University,  Baltimore  Maryland  21231 
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abstract:  a  highly  consistent  trait  of  tumor  stromal  fibroblasts  is  the  induction  of  the  membrane-bound 
serine  protease  fibroblast  activation  protein-a  (FAP),  which  is  overexpressed  on  the  surface  of  reactive 
stromal  fibroblasts  present  within  the  stroma  of  the  majority  of  human  epithelial  tumors.  In  contrast,  FAP 
is  not  expressed  by  tumor  epithelial  cells  or  by  fibroblasts  or  other  cell  types  in  normal  tissues.  The 
proteolytic  activity  of  FAP,  therefore,  represents  a  potential  pan-tumor  target  that  can  be  exploited  for  the 
release  of  potent  cytotoxins  from  inactive  prodrugs  consisting  of  an  FAP  peptide  substrate  coupled  to  a 
cytotoxin.  To  identify  FAP  peptide  substrates,  we  used  liquid  chromatography  tandem  mass  spectroscopy 
based  sequencing  to  generate  a  complete  map  of  the  FAP  cleavage  sites  within  human  collagen  I  derived 
gelatin.  Positional  analysis  of  the  frequency  of  each  amino  acid  at  each  position  within  the  cleavage  sites 
revealed  FAP  consensus  sequences  PPGP  and  (D/E)-(R/K)-G-(E/D)-(T/S)-G-P.  These  studies  further 
demonstrated  that  ranking  cleavage  sites  based  on  the  magnitude  of  the  FC/MS/MS  extracted  ion  current 
predicted  FAP  substrates  that  were  cleaved  with  highest  efficiency.  Fluorescence-quenched  peptides  were 
synthesized  on  the  basis  of  the  cleavage  sites  with  the  highest  ion  current  rankings,  and  kinetic  parameters 
for  FAP  hydrolysis  were  determined.  The  substrate  DRGETGP,  which  corresponded  to  the  consensus 
sequence,  had  the  lowest  A'm  of  21  /^M.  Overall  the  values  were  relatively  similar  for  both  high  and 
low  ranked  substrates,  whereas  the  /ccat  values  differed  by  up  to  100-fold.  On  the  basis  of  these  results,  the 
FAP  consensus  sequences  are  currently  being  evaluated  as  FAP-selective  peptide  carriers  for  incorporation 
into  FAP-activated  prodrugs. 


The  growth  of  epithelial  neoplasms  requires  the  formation 
of  a  supporting  tumor  stroma  to  supply  nutrients  and  growth 
factors  for  tumor  cell  survival  and  continued  growth.  This 
invasive  growth  is  associated  with  characteristic  changes  in 
the  supporting  stroma  that  include  induction  of  tumor  blood 
vessel  formation,  the  recraitment  of  reactive  stromal  myo¬ 
fibroblasts,  lymphocytes,  and  macrophages,  the  release  of 
peptide  signaling  molecules  and  proteases,  and  the  production 
of  an  altered  extracellular  matrix  (7—5).  The  tumor  stroma 
compartment  represents  a  major  component  of  the  mass  of 
most  carcinomas,  with  20—50%  commonly  seen  in  breast, 
lung,  and  colorectal  cancers  and  reaching  >90%  in  carci¬ 
nomas  that  have  desmoplastic  reactions  such  as  breast  and 
pancreatic  cancers  (5,  6). 


^  This  work  supported  by  funding  from  the  Susan  G.  Komen  Breast 
Cancer  Foundation  (to  S.R.D.),  the  Department  of  Defense  Breast 
Cancer  Research  Program  (Grant  DAMD17-03-1-0304)  (to  S.R.D.), 
and  a  Department  of  Defense  Prostate  Cancer  Research  Program  Pre- 
Doctoral  Award  (to  W.N.B.).  The  Mass  Spectrometry/Proteomics 
Facility  at  the  Johns  Hopkins  University  School  of  Medicine  is 
supported  by  NCRR  Grant  1S10-RR14702,  the  Johns  Hopkins  Fund 
for  Medical  Discovery,  and  the  Institute  for  Cell. 

*  To  whom  correspondence  should  be  addressed.  Phone:  (410)  502- 
3941.  Fax:  (410)  614-8397.  E-mail:denmesa@jhmi.edu. 

*  The  Sidney  Kimmel  Comprehensive  Cancer  Center  at  Johns 
Hopkins. 

^  Chemical  and  Biomolecular  Engineering  Department. 

"  Department  of  Pharmacology  and  Molecular  Sciences. 


Unlike  malignant  epithelial  cells,  activated  tumor  stromal 
fibroblasts  are  not  transformed  genetically  and  do  not 
demonstrate  the  genetic  and  phenotypic  heterogeneity  seen 
in  malignant  cells.  Reactive  tumor  stromal  fibroblasts  differ 
from  fibroblasts  of  normal  adult  tissues  in  regard  to 
morphology,  gene  expression  profiles,  and  production  of 
important  biological  mediators  such  as  growth  factors  and 
proteases  (7,  6,  7).  For  example,  a  highly  consistent  trait  of 
tumor  stromal  fibroblasts  is  the  induction  of  fibroblast 
activation  protein-a  (FAP).'  FAP  was  originally  identified 
as  an  inducible  antigen  expressed  on  reactive  stroma  and 
given  the  name  “fibroblast  activation  protein”.  FAP  was 
independently  identified  by  a  second  group  as  a  gelatinase 
expressed  by  aggressive  melanoma  cell  lines  and  was  given 
the  name  “seprase”  for  surface-expressed  protease  (8). 
Subsequent  cloning  of  FAP  and  seprase  revealed  that  they 
are  the  same  cell-surface  serine  protease. 

FAP  was  originally  reported  to  be  a  cell-surface  antigen 
recognized  on  human  astrocytes  and  sarcoma  cell  lines  in 
vitro  by  the  FI 9  monoclonal  antibody  (MAh)  (9).  In  one 
series  using  human  tissues,  FAP  was  detected  in  the  stroma 
of  over  90%  of  malignant  breast,  colorectal,  skin,  and 


'  Abbreviations:  FAP,  fibroblast  activation  protein;  LC/MS/MS, 
liquid  chromatography/tandem  mass  spectrometry;  MALDI-TOF,  matrix- 
assisted  laser  desorption/ionization  time  of  flight;  Mca,  7-methoxy- 
coumarin-4-acetic  acid;  Dnp,  dinitrophenyl. 
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pancreatic  tumors  (7,  10).  In  contrast,  most  normal  adult 
tissues  have  demonstrated  no  detectable  FAP  protein  expres¬ 
sion  (7).  FAP  expression  has  been  most  characterized  in 
breast  tissue.  Garin-Chesa  et  al.  used  the  F19  MAb  to 
demonstrate  strong  (12/14)  and  moderate  (2/14)  expression 
of  FAP  in  the  stroma  of  human  breast  carcinomas  but 
observed  no  expression  by  breast  cancer  epithelial  cells  and 
no  expression  in  adjacent  normal  breast  tissue.  Additionally, 
little  to  no  expression  was  observed  in  the  stroma  or  epithelial 
cells  of  10/10  samples  of  fibrocystic  disease  and  2/2  samples 
of  fibroadenomas  (7). 

FAP  is  a  member  of  the  enzyme  class  known  as  post- 
prolyl  peptidases  that  are  uniquely  capable  of  cleaving  the 
Pro— X  amino  acid  bond.  These  enzymes  have  been  dem¬ 
onstrated  to  play  a  role  in  cancer  biology  and  are  capable  of 
modifying  bioactive  peptides  (77).  This  group  of  proteases 
includes  the  well-characterized  dipeptidyl  peptidase  IV 
(DPPIV)  as  well  as  DPPII,  DPP6,  DPP7,  DPP8,  DPP9,  prolyl 
carboxypeptidase,  and  prolyl  endopeptidase  (77).  The  sub¬ 
strate  preferences  for  many  of  these  prolyl  peptidases  are 
not  entirely  known,  but  like  DPPIV,  they  all  have  dipeptidase 
functionality.  FAP  is  highly  homologous  to  DPPIV  (77).  Like 
DPPIV,  FAP  is  a  type  II  integral  membrane  protein  that  is 
able  to  cleave  peptides  with  proline  as  the  penultimate  amino 
acid  (72).  However,  FAP  differs  from  DPPIV  in  that  it  also 
has  gelatinase  and,  possibly,  collagenase  activity  {8,  11). 
Using  zymography,  FAP  was  demonstrated  to  cleave  gelatin 
and  human  collagen  I  but  was  unable  to  cleave  human 
fibronectin,  laminin,  or  collagen  IV  (72).  These  results 
suggest  that  FAP’s  physiologic  function  may  be  primarily 
that  of  an  endopeptidase  that  can  degrade  proteins  rather  than 
a  dipeptidase  like  DPPIV.  This  additional  gelatinase/colla- 
genase  activity  may  be  unique  to  FAP  among  the  family  of 
prolyl  proteases.  Unlike  DPPIV,  FAP  is  also  not  widely 
expressed  in  most  normal  tissues  (77). 

Our  laboratory  has  been  engaged  in  the  development  of 
prodrugs  that  can  be  selectively  activated  by  tissue-specific 
proteases  (13—17).  These  prodrugs  are  produced  by  coupling 
a  cytotoxic  agent  to  a  peptide  carrier  to  produce  an  inactive 
compound  that  can  only  become  activated  upon  release  of 
the  cytotoxin  from  the  peptide  by  proteolysis.  Our  initial 
efforts  have  focused  on  the  development  of  prodrugs 
activated  by  the  prostate  cancer  serine  proteases  prostate- 
specific  antigen  (PSA)  and  human  glandular  kallikrein  2 
(hK2)  that  are  both  members  of  the  kallikrein  family  (75— 
16).  In  contrast  to  this  tissue-specific  approach,  FAP 
represents  a  potential  pan-tumor  target.  To  develop  FAP- 
activated  prodrugs  requires  the  identification  of  a  peptide 
substrate  that  is  selectively  hydrolyzed  by  FAP.  In  the  current 
study,  we  present  results  using  liquid  chromatography/tandem 
mass  spectrometry  (LC/MS/MS)  to  generate  a  complete  map 
of  FAP  cleavage  sites  within  recombinant  forms  of  human 
collagen  I  derived  gelatin.  We  have  synthesized  selected 
peptides  on  the  basis  of  these  cleavage  maps  and  analyzed 
them  for  hydrolysis  by  FAP  to  identify  peptides  that  could 
be  used  to  target  cytotoxins  to  FAP-expressing  tumor  tissue. 

EXPERIMENTAL  PROCEDURES 

Materials.  The  Drosophila  Expression  System  (DES)  was 
from  Invitrogen  (Rockville,  MD).  Peptide  Ala-Pro-AEC 
[AEC  =  7-amino-4-(trifluoromethyl)coumarin]  was  from 


Bachem  (Heidelberg,  Germany).  Gly-Pro-AMC,  the  MMP 
substrate  sampler  kit,  and  all  other  peptide  synthesis  reagents 
were  from  Anaspec  (San  Jose,  CA).  Novatag  Dnp  resin, 
A-(7-methoxycoumarin-4-acetyloxy)succinimide  (Mca-Osu), 
1 -hydroxybenzotriazole  (HOBt),  and /V-methyl-2-pyrrolidone 
(NMP)  were  from  Novabiochem,  San  Diego,  CA.  Unless 
otherwise  indicated  all  the  other  reagents  were  from  Sigma- 
Aldrich  (St.  Louis,  MO). 

FAP  Cloning  and  Expression.  A  PCR  approach  was  used 
to  amplify  and  attach  a  Hise  tag  to  the  amino  terminus  of 
the  extracellular  domain  of  PAP  (Genbank  accession  number 
NM_004460).  The  primers  used  were  (forward  BgBY) 
5'-GGAAGATCrCATCATCACCATCACCATCGCCCT- 
TCAAG-3'  and  (reverse  Xhol)  5'-GGCCTCGAGTCATT- 
AGTCTGACAAAGAGAAACACTGC-3'.  Template  ampli¬ 
fication  was  performed  using  Pfu  polymerase  (Promega, 
Madison)  as  per  the  suggested  protocol.  A  PCR  reaction 
began  with  an  initial  denaturation  step  (94  °C  for  2  min) 
followed  by  three  cycles  of  amplification  (94  °C  for  30  s, 
40  °C  for  1  min,  72  °C  for  2  min),  followed  by  30  cycles  of 
amplification  (94  °C  for  30  s,  58  °C  for  1  min,  72  °C  for  2 
min),  and  ended  with  a  final  extension  step  (72  °C  for  10 
min).  A  2  kb  PCR  fragment  was  purified  by  gel  electro¬ 
phoresis,  digested  with  BglWIXhol,  and  cloned  into  pMT/ 
BiP/V5— HisA  (Invitrogen,  California)  previously  digested 
with  the  same  set  of  enzymes.  The  final  construct  was 
designated  as  pMT— His— PAP. 

Transfection  of  Insect  Cells  and  Stable  Cell  Line  Genera¬ 
tion.  Schneider’s  S2  cells  (Invitrogen)  were  maintained  in 
DES  medium  (Gibco,  Rockville,  MD)  supplemented  with 
10%  heat-inactivated  fetal  bovine  serum  (PBS)  at  room 
temperature.  Before  transfection,  the  cells  were  seeded  in  a 
35  mm  dish  and  grown  until  they  reached  a  density  of  (2— 
4)  X  10®  cells/mL.  The  cells  were  cotransfected  with  19  /rg 
of  pMT— His— PAP  and  1  //g  of  a  pCoHYGRO  selection 
vector  using  a  kit  for  calcium  phosphate-mediated  transfec¬ 
tion  (Invitrogen).  The  calcium  phosphate  solution  was 
removed  16  h  post-transfection,  and  fresh  DES  medium 
supplemented  with  10%  PBS  was  added  (a  complete 
medium).  The  cells  were  grown  for  an  additional  2  days, 
and  then  the  medium  was  replaced  with  the  complete 
medium  containing  400  /rg/mP  hygromycin  B  (Invitrogen). 
The  selection  medium  was  changed  every  3—4  days. 
Extensive  cell  death  of  nontransfected  cells  was  evident  after 
about  1  week,  and  cells  resistant  to  hygromycin  B  started  to 
grow  out  2—3  weeks  post-transfection. 

His-Tagged  FAP  Large-Scale  Expression  and  Purification. 
The  hygromycin-resistant  cells  were  seeded  in  10  T-150’s 
at  a  density  of  1  million  cells/mL.  When  the  cells  reached  a 
density  of  2—3  million  cells/mL,  500  pM  CUSO4  was  added 
to  induce  PAP  expression.  The  cells  were  grown  until  they 
reached  a  density  of  10—15  million  cells/mL  (8—9  days).  A 
2  mP  portion  of  200  mm  L-glutamine  was  added  to  the  cell 
suspension  on  days  2  and  6.  Conditioned  medium  containing 
secreted  PAP  was  collected  after  12—14  days.  The  medium 
was  concentrated,  and  excess  CUSO4  was  removed  by  three 
rounds  of  ultrafiltration  using  an  Amicon  8480  membrane 
(Millipore)  with  a  30  000  kDa  cutoff.  After  each  round  of 
ultrafiltration,  the  volume  was  made  up  using  sterile  water. 
Pinal  purification  was  obtained  by  incubating  the  concentrate 
with  Ni— NTA  resin  (Qiagen,  California)  in  manufacturer- 
recommended  salt  and  imidazole  concentrations.  PAP  was 


Biochemistry 

eluted  from  the  resin  using  250  mM  imidazole.  The  final  30 
mL  of  eluate  was  diluted  with  water  to  300  mL,  and 
imidazole  was  removed  by  two  rounds  of  ultrafiltration.  The 
purity  was  checked  by  SDS— PAGE  and  Coomassie  staining. 
Western  blots  were  probed  with  anti-His  tag  [penta-His— 
horse  radish  peroxidase  (HRP)  conjugate  from  Qiagen). 
Overall,  a  yield  of  1—2  mg  was  obtained  from  a  700  mL 
culture.  Final  purified  aliquots  were  stored  in  reaction  buffer 
at  -20  °C. 

FAP  Enzyme  Activity.  FAP  dipeptidyl  peptidase  activity 
was  determined  by  digesting  500  //M  Ala-Pro-AFC  (Cal- 
biochem)  with  rhFAP  as  described  by  Park  et  al.  (72).  Assays 
were  performed  at  23  °C  in  100  mM  Tris,  100  mM  NaCl  at 
pH  7.8  in  10%  DMSO  and  0.3%  Brij-35.  The  fluorescence 
output  was  monitored  every  30  s  using  a  DTX  880  multi- 
mode  detector  (Beckman  Dickinson).  Standard  curves  of 
AFC  (Calbiochem)  fluorescence  vs  concentration  were  run 
with  each  assay  to  convert  relative  fluorescence  units  to 
moles  of  product  generated.  Excitation/emission  wavelengths 
of  370  and  535  nm,  respectively,  were  used  to  monitor 
liberation  of  the  AFC  fluorophore.  The  rate  of  hydrolysis 
(mol/min)  was  ascertained  by  determining  the  slope  during 
the  first  minute  of  the  reaction.  This  rate  was  then  used  to 
calculate  the  enzyme  units  per  unit  volume  with  1  unit  of 
enzyme  activity  being  defined  as  the  cleavage  of  60  /rmol 
of  substrate/min. 

FAP  Gelatinase  Assay.  Quenched  gelatin  and  collagen 
conjugates  were  used  to  detect  and  confirm  FAP’s  gelatinase 
and  collagenase  activity.  DQ  gelatin  from  pig  skin  and  DQ 
collagen  type  IV  from  human  placenta  fluorescence- 
quenched  conjugates  (Invitrogen,  Rockville,  MD)  were 
digested  with  FAP,  and  digestion  was  monitored  on  a 
fluorescence  plate  reader.  Protein  substrates  were  dissolved 
in  reaction  buffer  (100  mM  NaCl,  100  mM  Tris,  pH  7.8)  to 
a  final  concentration  of  100  fig/mL.  Trypsin  digestion  of  each 
protein  was  used  as  a  positive  control.  As  a  negative  control 
the  His-tagged  extracellular  domain  of  prostate-specific 
membrane  antigen  (PSMA),  which  was  similarly  purified 
from  S2  cells  under  the  same  conditions  as  FAP,  was  also 
incubated  with  the  quenched  proteins.  Fluorescence-quenched 
DQ  bovine  serum  albumin  (BSA)  was  used  as  a  negative 
control  for  FAP  protease  activity. 

Digestion  of  Recombinant  Gelatin  with  FAP  for  Cleavage 
Mapping.  Recombinant  human  gelatins  of  100  and  8.5  kDa 
(Fibrogen,  San  Francisco,  CA)  were  dissolved  in  reaction 
buffer,  and  1  pg  of  FAP  was  added  per  100  pg  of  protein 
substrate.  Digestion  was  done  for  4—6  h  at  37  °C.  As  a 
positive  control,  trypsin  digestion  was  performed.  As  a 
negative  control,  protein  solutions  were  incubated  with  either 
BS A/buffer  or  buffer  alone.  Peptide  fragments  of  size  <30 
kDa  were  purified  using  a  30  kDa  Microcon  spin  filter 
(Millipore,  Billerica,  MA).  The  fragments  were  further 
purified  with  Ci8  spin  tubes  (Agilent,  Palo  Alto,  CA)  as  per 
the  suggested  protocol  with  the  substitution  of  0.5%  aceto¬ 
nitrile  in  place  of  5%  for  binding  and  washing  of  the  Cis 
columns.  Samples  were  prepared  for  matrix-assisted  laser 
desorption/ionization  time  of  flight  (MALDI-TOF)  analysis 
on  an  Applied  Biosystems  Voyager-DE  STR  (mass  accuracy 
>  100  ppm)  by  100:1  dilution  with  2,5-dihydroxybenzoic  acid 
(DHB)  as  the  matrix.  Analysis  was  performed  in  the  linear 
mode  over  a  mass  acquisition  range  of  500—5000  Da  with 
50  laser  shots  performed  per  spectrum.  Instrument  calibration 
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was  performed  routinely  by  the  Mass  Spectrometry  Core 
Facility  and  was  not  performed  prior  to  individual  analyses. 

Nanoflow  FIPLC  and  Mass  Spectrometry.  Peptides  ob¬ 
tained  from  the  FAP/gelatin  digests  were  dried  using  a 
Speed  Vac  (Eppendorf),  resuspended  in  LC/MS  loading 
buffer  (3%  ACN,  0.1%  formic  acid),  and  analyzed  using 
nanoflow  LC/MS/MS  on  an  Agilent  1100  series  nano-LC 
system  (Agilent)  coupled  to  an  LCQ  DUO  ion  trap  mass 
spectrometer  (ThermoFinnigan).  The  LCQ  was  calibrated 
{m/z  50—2000)  using  the  automatic  calibration  procedure 
outlined  by  the  manufacturer.  A  fused  silica  capillary  was 
loaded  with  a  solution  containing  the  Met-Arg-Phe-Ala 
(MRFA)  peptide,  caffeine,  and  Ultramark  1621  in  50% 
acetonitrile/0. 1%  formic  acid.  The  capillary  was  connected 
to  the  nanoflow  HPLC  instrument  and  sprayed  into  the  LCQ 
using  the  same  flow  rate  and  spray  voltage  used  in  all 
subsequent  experiments.  The  calibration  was  tested  using  a 
standard  100  fmol  tryptic  digest  of  BSA  and  demonstrated 
a  mass  accuracy  of  <0.5  Da. 

Peptides  were  preconcentrated  on  a  5  mm  Zorbax  C18 
trap  column  (Agilent)  and  then  eluted  onto  a  100  x  0.075 
mm  custom-packed  Biobasic  C18  (ThermoElectron)  re  versed- 
phase  capillary  column  connected  to  a  laser-pulled  electro¬ 
spray  ionization  emitter  tip  (New  Objective)  at  a  flow  rate 
of  300  nL/min.  Peptides  were  eluted  from  the  nanospray 
source  of  the  LCQ  (Proxeon,  Denmark)  using  the  following 
gradient:  0%  B  at  0  min,  5%  B  at  8  min,  45%  B  at  50  min, 
90%  B  at  55  min,  90%  B  at  60  min  (solvent  B  is  0.1%  formic 
acid  in  acetonitrile)  at  a  spray  voltage  of  2.5  kV.  The  LCQ 
was  operated  in  data-dependent  mode  using  the  Xcalibur 
software  (ThermoFinnigan)  in  which  every  MS  scan  (400— 
1800  miz)  was  followed  by  MS/MS  scans  (400—1800  m/z) 
on  the  three  most  intense  ions  using  an  isolation  window  of 
±1.5  Da.  Ions  selected  for  MS/MS  fragmentation  were 
dynamically  excluded  for  30  s. 

MS/MS  data  were  searched  against  a  collagen  FASTA 
database  using  the  SEQUEST  search  algorithm  built  into  the 
Bioworks  Browser  (ThermoFinnigan),  allowing  for  the 
variable  modification  of  methionine  oxidation.  The  peptides 
were  initially  filtered  in  a  charge-dependent  manner  using 
an  XCorr  filter  of  1.5,  2,  and  2.5  for  singly,  doubly,  and 
triply  charged  peptides.  All  MS/MS  spectra  used  to  identify 
peptides  were  manually  inspected  for  validation  of  the  y  and 
b  ion  series.  To  quantify  the  relative  abundance  of  each 
identified  peptide,  we  compared  the  ion  current  for  each  of 
the  observed  peptide  parent  ions  from  the  MS  spectra.  The 
contribution  of  each  parent  ion  to  the  total  ion  current  was 
extracted  and  integrated  over  the  peptide  elution  peak. 

Synthesis  of  Substrates  Based  on  Determined  Cleavage 
Sites.  Quenched  peptide  substrates  were  prepared  by  using 
the  7-methoxycoumarin-4-acetic  acid  (Mca)/Dnp  fluoro- 
phore/quencher  pair.  Synthesis  of  peptides  was  done  using 
standard  Fmoc  solid-phase  coupling  on  NovaTag  Dnp  resin 
with  a  substitution  level  of  0.4  mmol/g  (Novabiochem,  San 
Diego,  CA).  N-terminal  capping  was  done  twice  overnight 
with  Mca-Osu  and  HOBt  in  NMP.  The  peptides  were  cleaved 
with  95%  TFA,  2.5%  TIS,  and  2.5%  water.  The  purity  and 
mass  of  each  quenched  peptide  was  confirmed  by  reversed- 
phase  HPLC  and  MALDI-TOF  analysis. 

Protease  Assays.  The  substrates  were  dissolved  in  DMSO 
at  a  stock  concentration  of  10  mM.  Working  concentrations 
of  each  substrate  were  made  by  making  2-fold  dilutions 
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Figure  1:  Characterization  of  recombinant  Flis-tagged  FAP.  (A)  Flydrolysis  of  the  dipeptide  substrate  Ala-Pro-AFC  (500  /<M)  by  FAP  (5 
/tg/mL)  compared  to  DPPIV.  The  inset  shows  Western  blot  analysis  of  purified  Flis— FAP  following  denaturing  SDS— PAGE.  (B)  Digestion 
of  indicated  FITC-labeled  proteins  (100  //g/mL)  by  FAP  (5  ^g/mL). 


ranging  from  500  to  7.9  /rM.  rhFAP  (lOE— 4  units,  ~210— 
230  nM  depending  on  the  specific  activity  of  the  enzyme) 
was  added  to  each  assay  where  enzyme  was  included.  The 
fluorescence  of  the  cleavage  product  resulting  from  protease 
activity  was  monitored  every  30  s  using  a  DTX  880 
multimode  detector  (Beckman  Dickinson).  Standard  curves 
of  Mca  (Novabiochem)  fluorescence  vs  concentration  were 
run  with  each  assay  to  convert  relative  fluorescence  units  to 
moles  of  product  generated.  Excitation  and  emission  wave¬ 
lengths  of  340  and  435  nm,  respectively,  were  used  to 
monitor  liberation  of  the  Mca  fluorophore  peptide  cleavage 
fragment.  Assays  were  performed  at  23  °C  in  100  mM  Tris, 
100  mM  NaCl  at  pH  7.8  in  10%  DMSO  and  0.3%  Brij-35. 
Kinetic  constants  (Km  and  kcaO  were  calculated  on  the  basis 
of  the  rate  of  hydrolysis  {v)  during  the  first  minute  of  the 
reaction.  Kinetic  parameters  were  calculated  from  Michae- 
lis— Menten  plots  {v  vs  [S])  with  nonlinear  regression  analysis 
using  SigmaPlot  software. 

RESULTS 

Purification  of  Enzymatically  Active  FAP.  The  character¬ 
ization  of  protease  substrate  specificity  requires  that  the 
protease  be  of  maximum  purity  and  correctly  folded  to 
maintain  enzymatic  activity.  Previously  it  had  been  shown 
that  full-length  PAP,  cloned  and  expressed  in  Drosophila 
S2  cells,  yielded  highly  pure  protein  that  was  enzymatically 
similar  to  the  human  form  (18).  Therefore,  the  extracellular 
domain  of  PAP  was  cloned  with  a  Hise  tag  at  its  N-terminus 
to  generate  a  stable  PAP-producing  Drosophila  S2  cell  line. 
On  induction  with  CUSO4,  PAP  was  secreted  into  the 
medium,  which  was  then  concentrated  by  ultrafiltration  and 
purified  using  Ni— NTA  beads.  Purified  PAP  was  demon¬ 
strated  to  be  enzymatically  active  via  its  ability  to  cleave 
the  dipeptide  substrate  Ala-Pro-APC  (18)  (Pigure  lA). 
Western  blot  analysis  with  an  anti-His  tag  MAb  documented 
the  correct  protein  size  of  ~80  kDa  (Pigure  lA,  inset). 

Recombinant  FAP  Retains  Gelatinase  Activity.  Quenched 
forms  of  gelatin  and  collagen  were  used  to  confirm  the 
gelatinase  and  collagenase  activity  of  recombinant  PAP. 
Quenching  is  achieved  by  heavily  labeling  these  proteins  with 
the  fluorophore  PITC  such  that  the  fluorescence  signal  from 
the  intact  protein  is  minimal  due  to  self-quenching  by  the 
fluorophore.  Protein  digestion  releases  PITC-labeled  frag¬ 
ments  that  result  in  a  measurable  increase  in  overall 


fluorescence  signal  from  the  reaction  mixture.  Previously, 
it  had  been  demonstrated  using  gel  zymography  that  PAP 
can  cleave  gelatin  and  collagen  I  but  could  not  cleave 
collagen  IV  (12).  It  remains  unclear  whether  PAP  can  digest 
collagen  I  directly  as  recent  studies  suggest  that  the  observed 
digestion  may  be  due  to  an  artifactual  conversion  of  collagen 
I  to  gelatin  during  preparation  (72,  79,  20).  To  confirm  that 
our  recombinant  PAP  maintained  the  ability  to  cleave  gelatin, 
we  used  the  PITC-quenched  protein  DQ  gelatin  from  pig 
skin,  with  DQ  collagen  IV  from  human  placenta  serving  as 
a  negative  control.  In  this  assay,  gelatin  was  readily 
hydrolyzed  by  PAP  while  collagen  IV  was  not  hydrolyzed 
(Pigure  IB).  MALDI-TOP  analysis  of  the  digested  fragments 
was  performed  to  confirm  PAP  hydrolysis.  As  a  negative 
control,  we  demonstrated  no  digestion  of  any  of  the  proteins 
using  His-tagged  human  carboxypeptidase  PSMA  purified 
from  Drosophila  S2  cells  under  the  same  conditions  (data 
not  shown).  These  results  confirm  that  gelatin  hydrolysis  was 
due  to  PAP  and  not  due  to  the  presence  of  some  other 
protease  contaminating  our  purification  system. 

MAFDI  for  FAP  Digest  of  Denatured  Human  Collagen  I. 
To  elucidate  the  substrate  specificity  of  PAP,  we  examined 
PAP  digests  of  unlabeled  denatured  human  collagen  I  using 
MALDI-TOP  mass  spectrometry  (Pigure  2A).  Digestion 
reactions  were  performed  at  a  substrate  to  protease  mass  ratio 
of  200;  I  using  recombinant  PAP  or  modified  trypsin  as  a 
control.  Two  negative  controls  of  collagen  alone  and  PAP 
alone  were  also  included  to  identify  any  peptides  due  to 
autolysis/degradation  of  these  proteins.  SDS— PAGE  analysis 
of  PAP-digested  denatured  collagen  I  (not  shown)  produced 
a  smear  of  continuous  size  fragments,  suggesting  the  presence 
of  many  cleavage  sites.  To  simplify  cleavage  mapping  by 
MALDI-TOP,  small  fragments  (<5  kDa)  were  isolated  by 
ultrafiltration  and  further  purified  using  reversed-phase 
chromatography,  Pigure  2A.  MALDI-TOP  was  subsequently 
performed  using  serial  dilutions  of  the  isolated  peptides. 

The  masses  of  singly  charged  ions  [M  +  H]+  obtained 
from  MALDI  spectra  were  entered  into  the  PindPept  search 
tool  at  the  ExPASy  proteomics  server  (http://www.expasy- 
.org/tools/findpept.html)  and  used  to  perform  a  peptide  mass 
fingerprint  (PMP)  search  against  the  known  collagen  se¬ 
quence.  MALDI  spectra  suggest  that  denatured  human 
collagen  I  is  cleaved  by  PAP  at  numerous  specific  sites 
(Pigure  2A);  however,  we  were  unable  to  unambiguously 
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1399.4  2299  S  3300.2  4100.6  6001  0 

Mass  (m/z) 


FindPept 

Calculated 

Position 

Mass 

Peptide  Sequence 

in  FAP 

2113.07  GAAGEPGKAGERGVPGPPGA  VGPAG  55-79 
2113.12  GPKGAAGEPGKAGERGVPGP  PGAV  52-75 
2113.12  PKGAAGEPGKAGERGVPGPP  GAVG  53-76 
2113.12  KGAAGEPGKAGERGVPGPPG  AVGP  54-77 
2113.12  AGEPGKAGERGVPGPPGAVG  PAGK  57-80 
2114.07  GRPGPPGPPGARGQAGVMGF  PGP  31-53 
2115.01  AVGPAGKDGEAGAQGPPGPA  GPAGE  74-98 


Cleavage 

Fragment 

Theoretical 

[M+H|+ 

Charge 

m/z 

(observed) 

PPGAVGP/ 

AGK.AQGPPGP/ 

AGP 

1308.62 

2 

654.81 

GLP...  SPGSPGP/ 

DGK 

1449.77 

2 

725.39 

KTGPPGP/ 

AGQ... PPGPPGA/ 

RGQ 

1330.65 

2 

665.83 

VMGFPGP/ 

KGA.  ..PPGAVGP/ 

AGK 

2113.12 

2 

1057.06 

VMGFPGP/ 

KGA  ...GEPGKAG/ 

ERG 

942.5 

1 

942.50 

GLP.. KTGPPGP/ 

AGQ 

2256.16 

2 

1128.58 

GFPGPKG/ 

AAG  .  PPGAVGP/ 

AGK 

1928 

2 

964.50 

FPGPKGA/ 

AGE. ..PPGAVGP/ 

AGK 

1856.96 

2 

928.98 

LTGSPGS/ 

PGP. ..KTGPPGP/ 

AGQ 

1076.54 

2 

538.77 

KTGPPGP/ 

AGQ. GPPGPPG/ 

ARG 

1259.61 

1 

1259.61 

VMGFPGP/ 

KGA.  ..AGEPGKA/  GER 

885.48 

1 

885.48 

GLPGAKG/ 

LTG... SPGSPGP/ 

DGK 

869.44 

1 

869.44 

MGFPGPK/ 

GA-  AGEPGKA/ 

GER 

757.38 

1 

757.38 

PGPPGAR/ 

GQA- VMGFPGP/  KGA 

1017.48 

1 

1017.48 

PGARCQA/  G-  VMGFPGP/ 

KGA 

761.37 

1 

761.37 

GPPGPPG/ 

ARG, ..VMGFPGP/ 

KGA 

1244.62 

2 

622.81 

PPGPPGA/ 

RGQ. VMGFPGP/ 

KGA 

1173.58 

2 

587.29 

KTGPPGP/ 

AGQ...GARGQAG/  VMG 

1799.89 

2 

900.45 

481  glpgakg^ 
541  tgspg^pg^d  gktgppg^ag  gdgrpgppgp  gfpg^t^^ag  epgl^^^e^v 
601  pgppgavg^  gkdgeagaqg  ppgp^gpage  r 


Figure  3:  (A)  FAP  cleavage  sites  within  the  8.5  kDa  fragment  of 
recombinant  human  collagen  I  derived  gelatin.  Cleavage  sites  are 
bracketed  by  and  [M  +  Fl]+  denotes  the  mass  of  the  cleavage 
fragment.  Amino  acids  on  the  N-terminal  side  of  the  cleavage  site 
(i.e.,  P7— Pi)  are  in  bold  font.  (B)  Amino  acid  sequence  of  the  8.5 
kDa  fragment  with  FAP  cleavage  sites  indicated  by  arrows. 


Figure  2:  (A)  MALDI-TOF  mass  spectra  demonstrating  FAP 
digestion  of  human  collagen  I.  (B)  Example  of  cleavage  site 
sequence(s)  determination  using  the  FindPept  tool  for  a  selected 
mass  of  21 14.5  from  MALDI-TOF  mass  spectra.  Multiple  potential 
cleavage  site  sequences  within  ±1  mass  unit  are  identified  by  this 
analysis. 

identify  the  cleavage  sequences  using  the  FindPept  tool 
(Figure  2B).  In  most  cases,  multiple  peptide  sequences  were 
matched  for  the  same  mass,  and  in  some  instances,  more 
than  30  sequences  were  obtained  for  one  particular  mass. 
This  result  was  most  likely  due  to  the  fact  that  human 
collagen  I  is  a  heterotrimeric  polymer  made  up  of  repeating 
sequences  containing  the  (GXY)„  motif  (where  X  =  Pro  and 
Y  =  hydroxy-Pro).  Human  collagen  I  is  also  known  to  be 
glycosylated  and  cross-linked  randomly  throughout  its 
sequence  (27).  These  post-translational  modifications  in 
human  collagen  have  not  been  well  characterized  and, 
therefore,  make  it  more  difficult  to  determine  the  exact 
cleavage  sites  using  MALDI-TOF  coupled  with  other  pro- 
teomics  tools.  These  results,  while  demonstrating  that  FAP 
cleaved  denatured  human  collagen  I,  demonstrated  the 
difficulties  in  obtaining  correct  cleavage  sequences  by  mass 
spectrometry  due  to  the  polymeric  nature  and  poorly  defined 
qualities  of  human  collagen  I. 

LC  and  Tandem  MS/MS  Analysis  of  FAP  Digestion  of 
Recombinant  Gelatin.  As  collagen  I  derived  gelatin  is 
currently  one  of  the  only  known  protein  substrates  for  FAP, 
we  needed  to  develop  an  alternative  method  to  identify  FAP- 
selective  cleavage  sites  within  these  proteins.  To  solve  the 
problem  of  post-translational  modification,  we  identified  a 
source  of  recombinant  human  gelatin  and  collagen  from 
FibroGen  (South  San  Francisco,  CA)  that  was  prepared  by 
cloning  the  human  collagen  I  sequence  in  a  strain  of  Pichia 
pastoris  which  lacks  the  enzyme  prolyl  hydroxylase  (22). 
This  human  collagen  I  based  gelatin  is  well  characterized 
and  has  no  post-translational  modifications.  This  gelatin  is 
used  to  produce  drug  capsules  and  vaccine  adjuvants. 
Therefore,  this  recombinant  gelatin  was  used  in  subsequent 
FAP  cleavage  mapping  studies. 

We  next  used  MALDI-TOF  to  analyze  FAP  digestion  of 
the  100  kDa  recombinant  form  of  human  gelatin.  SDS— 


PAGE  separation  and  MALDI-TOF  spectra  showed  that  FAP 
readily  digests  the  recombinant  gelatin  (data  not  shown).  The 
masses  of  fragments  <3  kDa  were  purified  for  MALDI-TOF 
spectra  and  again  analyzed  using  the  FindPept  tool.  However, 
once  again  multiple  peptide  sequences  were  obtained  for  each 
cleavage  fragment. 

Therefore,  to  resolve  each  particular  mass  fragment,  an 
LC/MS/MS  method  was  developed.  To  initially  work  out 
the  methodological  issues,  peptide  fragments  from  the  FAP 
digest  of  the  8.5  kDa  recombinant  gelatin  were  time-resolved 
by  nano-reversed-phase  LC  and  then  sequenced  using  an  ion 
trap  mass  spectrometer  operating  in  MS/MS  mode  (Figure 
3A).  MS/MS  spectra  were  searched  against  the  8.5  kDa 
gelatin  sequences  using  the  SEQUEST  algorithm  with  no 
cleavage  specificity.  In  this  analysis  most  of  the  identified 
cleavage  sites  occurred  after  proline.  However,  EAP  cleavage 
sites  in  these  gelatins  were  not  restricted  to  proline  alone. 
EAP  was  also  found  to  cleave  after  Ala,  Arg,  Gly,  Lys,  and 
Ser.  PAP  was  also  able  to  cleave  after  multiple  adjacent 
amino  acids  within  the  8.5  kDa  gelatin  fragment  (Pigure  3B). 

FAP  Cleavage  Map  for  Full-Length  100  kDa  Gelatin.  On 
the  basis  of  the  results  with  the  8.5  kDa  gelatin  fragment, 
we  proceeded  to  use  this  LC/MS  method  to  identify  all  of 
the  PAP  cleavage  sites  within  the  full-length  100  kDa 
recombinant  gelatin.  In  an  effort  to  identify  the  most 
preferentially  cleaved  sites,  we  also  quantified  the  relative 
abundance  of  each  of  the  identified  peptides  by  integrating 
the  ion  current  generated  by  each  peptide  throughout  the 
chromatogram  (Pigure  4).  After  identification  of  a  peptide 
from  an  MS/MS  spectrum,  we  extracted  the  ion  current  for 
the  parent  mass  of  the  ion  from  the  total  MS  ion  chromato¬ 
gram  using  a  1.5  Da  tolerance  window  and  then  integrated 
under  the  peak.  In  the  case  of  multiple  peaks,  we  chose  the 
peak  nearest  to  the  retention  time  of  the  MS/MS  spectrum 
matched  to  the  peptide  sequence  of  interest. 

This  analysis  revealed  a  total  of  51  PAP  cleavage 
fragments  within  the  100  kDa  recombinant  gelatin  (Pigure 
4).  The  corresponding  fragment  was  mapped  within  the 
recombinant  gelatin  to  determine  the  sequence  of  the  amino 
acids  upstream  of  the  PAP  cleavage  site  toward  the  N- 
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[M+H]+ 

Charge 

m/z 

(observed) 

Extracted 

Ion  Current 

RTGDAGP/ 

VGP...PGPPGP 

1373.72 

1 

1373.72 

1.48E+11 

ASGPAGP/ 

RGP...LNGLPGP/  IGP 

1871.97 

2 

936.49 

4.94E+10 

DRGETGP/ 

AGP...APGAPGP/  VGP 

1 170.59 

1 

1170.59 

2.34E+10 

DRGETGP/ 

AGP... DRGETGP/  AGP 

2536.25 

2 

1268.63 

2.15E+10 

EPGPPGP/ 

AGF...DAGPPGP/  AGP 

2828.32 

3 

943.44 

1.52E+10 

RTGDAGP/ 

VGP...PGPPGPP/  GPP 

871.47 

1 

871.47 

1.47E+10 

GETGPAG/ 

PPG... DRGETGP/  AGP 

2408.20 

3 

803.40 

1.45E+10 

OPSGPAG/ 

PTG... EPGPPGP/  AGF 

1742.86 

2 

871.93 

1.31E+10 

PSGPAGP/ 

TGA... EPGPPGP/  AGF 

1645.81 

2 

823.40 

1.21E+10 

PAGAAGP/ 

AGN...FPGARGP/  SGP 

2812.39 

2 

1406.69 

1.07E+10 

PAGPPGA/ 

PGA... DRGETGP/  AGP 

2086.03 

2 

1043.52 

9.34E+09 

FQGLPGP/ 

AGP...DLGAPGP/  SGA 

2112.04 

2 

1056.52 

9.26E+09 

MGFPGP/ 

KGA... PPGAVGP/  AGK 

2113.12 

2 

1057.06 

7.40E+09 

PPGPAGP/ 

AGP... RVGPPGP/  SGN 

3556.85 

3 

1186.28 

6.68E+09 

RVGPPGP/ 

SGN... PPGPPGP/  AGK 

1004.48 

1 

1004.48 

5.99E+09 

AGRVGPP/ 

GPS...AGPPGPP/  GPA 

1004.48 

1 

1004.48 

5.85E+09 

PPGAPGP/ 

QGF...EPGASGP/  MGP 

1665.75 

1 

1665.75 

5.76E+09 

ETGPAGP/ 

PGA... DRGETGP/  AGP 

2311.14 

2 

1156.07 

4.82E+09 

PPGAPGA/ 

PGA... DRGETGP/  AGP 

1860.92 

2 

930.96 

4.39E+09 

GLTGPIG/ 

PPG...DKGESGP/  SGP 

1390.66 

2 

695.83 

4.22E+09 

DRGETGP/ 

AGP...APGPVGP/  AGK 

1423.73 

1 

1423.73 

3.85E+09 

ESGPSGP/ 

AGP... EPGPPGP/  AGF 

1870.92 

2 

935.96 

3.34E+09 

PRGETGP/ 

AGR...PPGPPGP/  AGE 

1341.69 

2 

671.35 

3.31E+09 

VRGLTGP/ 

IGP...  DKGESGP/  SGP 

1560.77 

1 

1560.77 

2.79E+09 

PPGPTGP/ 

AGP...AKGEAGP/  QGP 

1536.78 

2 

768.89 

2.61E+09 

FPGLPGP/ 

SGE...ERGPPGP/  MGP 

1905.91 

2 

953.45 

2.18E+09 

PPGPPGP/ 

AGE...APGTPGP/  QGI 

1747.83 

2 

874.41 

2.15E+09 

LTGPIGP/ 

PGP... DKGESGP/  SGP 

1293.61 

1 

1293.61 

2.11E+09 

AKGDAGP/ 

AGP...GENGAPG/  OMG 

1478.69 

1 

1478.69 

1.48E+09 

VMGFPGP/ 

KGA...AQGPPGP/  AGP 

3402.72 

3 

1134.91 

1.44E+09 

PPGPAGA/ 

PG-  DKGESGP/  SGP 

843.39 

1 

843.39 

1.14E+09 

PPGPMGP/ 

PGL...EGSPGRD/  GSP 

2275.07 

2 

1138.04 

1.06E+09 

GFPGLPG/ 

PSG...ERGPPGP/  MGP 

2002.96 

2 

1001.98 

l.OlE+09 

PPGPTGP/ 

AGP... EPGPPGP/  AGA 

3530.75 

3 

1177.58 

9.13E+08 

APGAPGA/ 

PGP... DRGETGP/  AGP 

1635.81 

2 

818.40 

8.35E+08 

PRGSEGP/ 

QGV... EPGPPGP/  AGA 

1147.59 

1 

1147.59 

8.14E+08 

TGDAGPV/ 

GP-  PGPPGPP/  GPP 

772.40 

1 

772.40 

7.43E+08 

GPAGFAG/ 

PPG...  DAGPPGP/  AGP 

2425.14 

3 

809.05 

6.91E+08 

AKGEPGP/ 

VG-  VQGPPGP/  AGE 

807.44 

1 

807.44 

6.87E+08 

AGPPGAD/ 

GQP... DAGPPGP/  AGP 

1987.95 

2 

994.47 

5.43E+08 

LPGPSGE/ 

PGK...ERGPPGP/  MGP 

1632.81 

2 

816.90 

3.83E+08 

GLTGPIG/ 

PPG...AGAPGDK/  GES 

963.49 

2 

482.25 

3.01E+08 

KTGPPGP/ 

AGO..GARGOAG/  VMG 

1799.89 

2 

900.45 

2.10E+08 

GAKGDAG/ 

PPG... PAG  PPGP/  IGN 

1068.55 

1 

1068.55 

1.59E+08 

RGETGPA/ 

GPP...APGAPGP/  VGP 

1099.55 

1 

1099.55 

1.31E+08 

ATGFPGA/ 

AG-  RVGPPGP/  SGN 

807.45 

1 

807.45 

1.03E+08 

PGPAGOD/ 

GRP...GARGOAG/  VMG 

1428.75 

2 

714.87 

7.53E+07 

SGPRGLP/ 

G-  PPGAPGP/  QGF 

649.33 

1 

649.33 

7.18E+07 

PPGAVGP/ 

AGK  AQGPPGP/  AGP 

1308.62 

2 

654.81 

5.98E+07 

AGPPGPA/ 

G-  PAGPPGP/  IGN 

649.33 

1 

649.33 

4.23E+07 

PSGPQGP/ 

GGP...APGSKGD/  TGA 

1819.86 

2 

910.43 

3.58E+07 

Figure  4:  FAP  cleavage  sites  within  the  100  kDa  recombinant  human  collagen  I  derived  gelatin.  Cleavage  sites  are  bracketed  by  The 
mass  of  the  cleavage  fragment,  [M  +  H]+,  and  extracted  ion  current  are  included  for  each  cleavage  site.  Underlined  sequences  indicate 
cleavage  sites  which  do  not  contain  Pro  (P)  in  the  PI  position.  The  consensus  sequences  PPGP  and  D-(R/K)-G-E-(T/S)-G-P  are  indicated 
by  bold  text. 


terminus.  Since  each  fragment  was  produced  by  two  cleavage 
events,  101  peptide  sequences  corresponding  to  the  amino 
acids  from  P7  to  P'l  could  be  mapped.  Cleavage  fragments 
were  then  ranked  on  the  basis  of  the  extracted  ion  current 
generated  by  each  peptide  (Figure  4).  This  ranking  again 
revealed  that  FAP  preferred  to  cleave  after  the  G-P  dipeptide. 
FAP  was  also  able  to  cleave  after  other  amino  acids  (i.e., 
Ala,  Asp,  Gly,  Glu,  Lys,  and  Val),  but  the  majority  of  these 
cleavage  fragments  had  very  low  extracted  ion  currents  and 
were  ranked  in  the  lower  half  of  the  list  (Figure  4).  Often 
these  non-proline  cleavage  sites  were  adjacent  to  a  proline 
cleavage  site.  This  result  was  also  observed  with  the  8.5  kDa 
digest  and  suggested  that  the  gelatin  protein  may  misalign 
within  the  FAP  catalytic  site,  resulting  in  low-level  cleavage 
after  non-proline  amino  acids. 


On  the  basis  of  these  cleavage  sites,  the  frequency  of 
individual  amino  acids  within  the  P7— P'l  positions  of  each 
cleavage  site  was  next  determined.  This  analysis  was 
somewhat  limited  by  the  repetitive  nature  of  the  collagen  I 
sequence,  and  it  was,  therefore,  not  necessarily  unexpected 
that  the  most  frequent  amino  acids  at  each  position  were 
either  proline  or  glycine,  producing  a  consensus  peptide  with 
the  sequence  PPGPPGP.  This  exact  sequence  is  found  in  3 
of  the  sequences  within  the  map,  while  the  peptide  PPGP  is 
found  in  21  of  the  101  sequences. 

The  frequency  analysis,  however,  did  reveal  some  ad¬ 
ditional  trends  that  could  bear  on  the  identification  of 
consensus  sequences  to  be  used  in  the  development  of  FAP- 
activated  peptide  prodrugs  (Figure  5).  After  proline  in  P7, 
for  instance,  the  acidic  amino  acids  Asp  (23%)  and  Glu 
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Figure  5:  Frequency  (%)  of  each  amino  acid  in  positions  P7— P'l  within  the  FAP  cleavage  sites  within  the  100  kDa  recombinant  human 
collagen  I  derived  gelatin.  Gray  bars  indicate  the  frequency  for  all  101  cleavage  sites.  Black  bars  indicate  the  frequency  in  only  those 
cleavage  sites  containing  Pro  (P)  at  the  PI  position.  The  amino  acids  C,  H,  W,  and  Y  are  not  present  in  human  collagen  I  and,  therefore, 
are  not  included. 


Table  1:  Frequency  Analysis  of  Consensus  Sequenees  within  the  100  kDa  Gelatin  Amino  Acid  Sequence 


amino 

acids 

total  no.  of 

G-P  cleavage 
sites”  (%) 

no.  of 

FAP-cleaved 
sites*  (%) 

P7 

P6 

P5 

P4 

P3 

P2 

PI 

PI' 

G 

P 

116(100) 

40  (34) 

G 

P 

A 

28  (24) 

20  (71) 

D/E 

K/R 

G 

D/E 

A/G/S/T 

G 

P 

4(3) 

3(75) 

A/D/E/P 

K/R 

G 

D/E/P 

A/G/S/T/P 

G 

P 

14(12) 

8(57) 

D/E 

K/R/P 

G 

D/E/P 

A/G/S/T/P 

G 

P 

8(7) 

6(75) 

combined'^ 

15(13) 

9(60) 

D/E/P 

K/R/P 

G 

D/E/P 

A/G/S/T/P 

G 

non-Pro 

7(6) 

0(0) 

"  Occurrence  of  the  G-P  dipeptide  sequence  in  the  100  kDa  gelatin  sequence.  ''  Number  of  times  FAP  cleaves  after  G-P  sites.  '  Total  unique  sites 
from  the  consensus  sequence  indicated  in  rows  3—5. 


(15%)  were  found  in  ~38%  of  the  sequences.  In  P6,  30% 
of  the  sequences  contained  one  of  the  basic  amino  acids  Lys 
(10%)  and  Arg  (20%).  In  P4,  Glu  was  observed  in  20%  of 
the  sequences  and  Asp  in  4%.  In  P3,  the  small  polar  amino 
acids  Ser  and  Thr,  which  both  contain  a  -OH  functional 
group,  were  found  in  24%  of  the  sequences  and  Ala  and 
Gly  with  a  small  or  no  side  chain  were  found  in  26%.  Finally, 
in  the  P'l  position,  Ala  was  observed  in  almost  half  of  the 
sequences  (Figure  5).  On  the  basis  of  this  analysis,  a  second 
consensus  sequence  emerged  [i.e.,  (D/E)-(R/K)-G-(E/D)-(A/ 
G/T/S)-G-P-A  or  (acidic  AA)-(basic  AA)-G-(acidic  AA)- 
(small/polar  OH  AA)-G-P-A)].  This  consensus  sequence  is 
found  in  15  of  the  101  peptide  sequences  within  the  cleavage 
map  (Eigure  4). 


To  determine  whether  these  consensus  sequence  motifs 
represented  EAP-preferred  cleavage  sites  or  instead  were 
merely  observed  due  to  a  relative  abundance  of  these 
particular  sequences  within  the  overall  gelatin  amino  acid 
sequence,  we  evaluated  the  frequency  of  occurrence  of  these 
sites  within  the  gelatin  protein  (Table  1).  Overall  there  were 
66  EAP  cleavage  sites  within  the  100  kDa  recombinant 
gelatin.  A  total  of  1 16  G-P  sites  are  found  within  the  protein, 
and  40  (34%)  of  these  were  cleaved  by  PAP,  Table  1.  To 
emphasize  the  importance  of  the  Ala  in  the  P'l  position,  28 
sites  with  the  tripeptide  G-P-A  exist  in  the  gelatin  sequence 
(24%  of  total  G-P-containing  sites).  However,  when  present, 
71%  of  these  sites  were  cleaved  by  PAP.  In  addition,  on  the 
basis  of  the  cleavage  site  analysis,  we  evaluated  the  frequency 
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Table  2:  Enzyme  Kinetics  of  FAP  Substrates  Generated  from  Cleavage  Sites  within  100  kDa  Gelatin 

no. 

substrate  sequence 

KmCttM) 

Vm  (mol/s) 

^cat(s  ') 

normalized 
ion  current  rank 

1 

Mca-ASGPAGPA-Dnp 

36.1  ±2.7 

2.26E-12±8.14E-14 

0.196  ±0.007 

5441  ±478 

2 

2 

Mca-EPGPPGPA-Dnp 

26.9  ±  0.7 

1.32E-12±9.53E-14 

0.115  ±0.006 

4270  ±  90 

5 

3 

Mca-DRGETGPA-Dnp 

21.0  ±2.2 

8.50E-13  ±  1.89E-14 

0.081  ±  0.002 

3851  ±366 

3,4,7 

4 

Mca-VGPAGK-Dnp 

52.0  ±  4.4 

1.09E-12±4.41E-14 

0.095  ±  0.004 

1822  ±  100 

13,21 

5 

Mca-DKGESGPA-Dnp 

46.0  ±  3.5 

7.69E-13±2.11E-14 

0.067  ±  0.001 

1453  ±  99 

20,  24 

6 

Mca-APGSKGDA-Dnp 

51.0±  11.8 

1.68E-13  ±  1.95E-13 

0.015  ±0.017 

286  ±  391 

51 

Table  3:  Enzyme  Kinetics  of  FAP  Substrates  Derived  from  the  Consensus  Sequence  DRGETGPA 

no. 

substrate  sequence 

Km  (pM) 

Vm  (mol/s) 

^cat  (s  ) 

kcJKm  (M-‘  s-‘) 

7 

Mca-DRGETGP-Dnp 

23.8  ±  6.8 

3.49E-14±8.02E-15 

0.003  ±  0.001 

127  ±31.9 

8 

Mca-DRGETGPA-Dnp 

21.0  ±2.2 

8.50E-13  ±  1.89E-14 

0.081  ±  0.002 

3851  ±  366 

9 

Mca-ERGETGPA-Dnp 

56.7  ±  0.7 

1.62E-12±5.20E-14 

0.141  ±0.004 

2483  ±  58 

10 

Mca-ERGETGPAG-Dnp 

43.0  ±  3.8 

1.39E-12±8.41E-14 

0.121  ±0.007 

2810  ±  149 

11 

Mca-ERGETGPAGG  -Dnp 

37.1  ±  1.1 

9.05E-13±2.69E-12 

0.079  ±  0.004 

2120  ±35 

12 

Mca-DSGETGP-Dnp 

39.6  ±  29.7 

3.88E-14±8.11E-15 

0.003  ±  0.001 

85  ±60 

13 

Mca-DSGETGPA-Dnp 

20.2  ±  0.5 

6.74E-13  ±4.65E-14 

0.059  ±  0.004 

2900  ±  221 

14 

Mca-ERGETGPSG-Dnp 

51.3  ±  1.9 

3.08E-12±  1.77E-13 

0.268  ±  0.015 

5218  ±316 

of  the  consensus  sequence  (D/E)-(K/R)-(D/E)-G-(A/G/S/T)- 
G-P,  the  expanded  consensus  sequence  (A/D/E/P)-(K/R)-(D/ 
E/P)-G-(A/G/S/T/P)-G-P,  in  which  Pro  is  also  included  in 
the  analysis  and  the  P6  position  only  included  the  basic 
amino  acids  K/R,  and  the  expanded  consensus  sequence  (D/ 
E)-(K/R/P)-(D/E/P)-G-(A/G/S/T/P)-G-P,  in  which  the  P7 
position  included  only  the  acidic  amino  acids  D/E  and  not 
Pro  or  Ala.  The  (D/E)-(K/R)-(D/E)-G-(A/G/S/T)-G-P  se¬ 
quence  is  cleaved  by  PAP  three  out  of  the  four  times  it 
appears  in  the  gelatin  sequence,  Table  1.  Overall  unique 
consensus  sequences  were  found  in  only  13%  of  the  total 
G-P  cleavage  sites  in  gelatin.  However,  when  present,  60% 
of  these  sequences  were  cleaved  by  PAP,  Table  1.  The 
presence  of  this  sequence,  however,  was  not  sufficient  to 
produce  PAP  cleavage  in  the  absence  of  proline  in  the  PI 
amino  acid,  thus  further  confirming  the  importance  of  proline 
in  this  position.  Seven  such  sequences  containing  (A/D/E/ 
P)-(K/R/P)-(D/E/P)-G-(A/G/S/T/P)-G-(non-proline)  were  iden¬ 
tified  in  the  gelatin  sequence,  and  none  of  these  were  cleaved 
by  PAP  (Table  1).  These  results  demonstrate  that  the  frequent 
appearance  of  the  (D/E)-(K/R)-(D/E)-G-(A/G/S/T)-G-P  se¬ 
quence  in  the  cleavage  fragments  is  not  due  to  the  frequent 
occurrence  of  this  motif  in  the  overall  gelatin  sequence,  but 
is  rather  due  to  PAP  preference  for  cleavage  after  this 
consensus  site. 

FAP  Hydrolysis  of  Fluorescence-Quenched  Peptides  Based 
on  the  Gelatin  Cleavage  Map.  While  previous  studies  had 
already  determined  that  PAP  preferred  to  cleave  after  the 
dipeptide  GP,  the  normalized  ion  current  analysis  also 
suggested  a  strong  preference  for  cleavage  after  GP  as  well 
as  a  strong  preference  for  cleavage  of  the  peptide  with  GP 
in  P2— PI  and  Ala  in  P'l.  To  determine  whether  this  type  of 
cleavage  site  ranking  based  on  the  abundance  of  each 
extracted  ion  in  the  MS  spectra  would  help  predict  which 
sequences  represented  good  PAP  substrates,  we  proceeded 
to  synthesize  a  series  of  fluorescently  quenched  peptide 
substrates  based  on  the  cleavage  sequences.  Por  this  analysis, 
we  generated  peptide  substrate  from  sequences  with  high 
(i.e.,  ASGPAGPA,  DRGETGPA),  intermediate  (i.e.,  EKGES- 
GP,  VGPAGK),  and  low  (i.e.,  GARGQAG,  APGSKGDA) 
extracted  ion  currents,  Pigure  4.  Michaelis— Menten  kinetic 
parameters  (i.e.,  K^,  Vmax,  ^cat)  of  PAP  hydrolysis  of  these 


fluorescence-quenched  peptides  were  determined.  kcJKm 
ratios  were  calculated  to  rank  these  peptides  as  PAP 
substrates  (Table  2). 

The  Mca-ASGPAGPA-Dnp  substrate  (1)  had  the  highest 
ratio  (Table  2).  This  peptide  sequence  was  also  based 
on  a  cleavage  fragment  with  the  second  highest  extracted 
ion  current.  Substrate  2,  based  on  the  fragment  with  the  fifth 
ranked  extracted  ion  current,  had  the  second  highest  kc-JK^ 
ratio.  In  addition,  the  Mca-DRGETGPA-Dnp  substrate  (3) 
had  the  third  highest  ratio.  This  substrate  is  based  on  a 
consensus  sequence  that  is  the  second  most  common  cleavage 
site  in  the  map  and  which  is  found  in  cleavage  fragments 
with  the  third,  fourth,  and  seventh  highest  extracted  ion 
currents.  In  contrast,  the  Mca-APGSKGDA-Dnp  substrate 
(6),  which  was  least  efficiently  cleaved  by  PAP,  was  based 
on  the  cleavage  fragment  with  the  lowest  overall  extracted 
ion  current  (Pigure  4).  Pormal  kinetic  studies  were  not 
performed  on  a  second  substrate,  Mca-GARGQAG-Dnp, 
based  on  the  fragment  with  the  47th  ranked  extracted  ion 
current,  but  this  substrate  demonstrated  a  low  hydrolysis  rate 
compared  to  rates  from  an  equimolar  concentration  (i.e.,  200 
/tM)  of  substrates  1—5.  Two  other  substrates  with  intermedi¬ 
ate  KcJKjn  ratios  [i.e.,  Mca-VGPAGK-Dnp  (4)  and  Mca- 
DKGESGPA-Dnp  (5)]  were  found  in  cleavage  fragments 
with  intermediate  extracted  ion  currents  (Table  2). 

In  a  final  set  of  studies,  we  synthesized  additional 
fluorescence-quenched  peptides  to  explore  the  importance 
of  amino  acids  in  the  P'  positions  on  the  carboxy  terminal 
side  of  the  cleavage  site  (Table  3).  A  comparison  of  the  K^  J 
Km  ratios  of  substrate  7  with  no  P'l  amino  acid  to  8  and 
substrate  12  to  13  demonstrated  that  PAP  prefers  an  amino 
acid  in  the  P'l  position.  Both  substrates  lacking  the  P'l 
alanine  were  poorly  hydrolyzed  by  PAP.  The  addition  of  a 
P'2  Gly  as  in  10  or  a  P'3  Gly  as  in  substrate  11  did  not 
substantially  improve  the  kinetic  ratio.  Other  comparisons 
demonstrated  a  slight  preference  for  Asp  in  the  P7  position 
as  in  substrate  8  compared  to  Glu  as  in  substrate  9.  In  the 
P6  position,  substitution  of  Ser  as  in  substrate  13  for  Arg  as 
in  substrate  8  resulted  in  some  decrease  in  the  kinetic  ratio. 
Pinally,  in  the  P'l  position,  while  Ser  was  only  observed  in 
12%  of  the  sequences  compared  to  47%  for  Ala,  the 
substitution  of  Ser  in  the  P'l  position  as  in  substrate  14 
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resulted  in  a  substantial  increase  in  the  k^  JK^  ratio  compared 
to  that  of  the  comparable  substrate  10  containing  Ala  in  this 
position  (Table  3).  The  substitution  of  Ser  in  the  P'l  position 
generated  a  peptide  substrate  with  a  kcat  that  was  2—5  fold 
higher  than  that  of  other  active  substrates. 

DISCUSSION 

The  overall  goal  of  this  study  was  to  develop  a  method  to 
identify  substrates  on  the  basis  of  FAP’s  collagenase  or 
gelatinase  activity.  This  knowledge  of  substrate  specificity 
can  be  used  to  elucidate  FAP’ s  biological  role  as  well  as  for 
development  of  targeted  therapeutic  prodrugs.  Substrate 
specificity  can  be  defined  by  using  either  by  high-throughput 
methods  such  as  the  positional  scanning  synthetic  combi¬ 
natorial  library  (75)  and  one-bead  one-peptide  library  (23) 
or  phage  display  (24,  25).  However,  these  methods  have  the 
disadvantage  of  an  artificial  scaffold  which  can  alter  the 
physiological  substrate  specificity  of  a  protease.  Here  we 
have  described  an  approach  to  take  a  known  protein  substrate 
for  a  protease  and  map  its  cleavage  site  using  proteomics. 
In  the  case  of  FAP,  the  problem  was  more  complicated 
because  of  the  polymeric  nature  of  the  collagen  I  protein 
coupled  with  multiple  types  of  post-translational  modifica¬ 
tions. 

A  large  number  of  collagenases  and  gelatinases  have  been 
previously  reported  in  the  literature.  However,  for  most  of 
them  substrate  characterization  was  done  using  combinatorial 
libraries  or  synthetic  model  substrates  (24,  26—29).  To  our 
knowledge  there  is  only  one  prior  reported  study  in  which 
the  collagen  protein  was  digested  with  a  new  type  of 
collagenase  and  cleavage  sites  determined  by  Edman  se¬ 
quencing  of  specific  bands  isolated  on  SDS— PAGE  (30). 
However,  in  this  previous  study  only  10—12  cleavage 
fragments  were  obtained.  Thus,  it  was  possible  to  easily 
separate  them  by  SDS— PAGE  (30).  In  contrast,  in  our  study 
the  EAP  digest  of  collagen  or  gelatin  produced  more  than 
100  fragments,  which  produced  a  smear  on  SDS— PAGE, 
rather  than  discrete  bands.  This  result  suggested  that  EAP 
cleaved  collagen  at  many  sites,  thus  making  traditional 
Edman  sequencing  impractical.  On  this  basis,  we  proceeded 
to  develop  the  LC/MS/MS -based  approach  to  map  these  EAP 
cleavage  sites. 

Denatured  human  collagen  I  was  digested  with  PAP,  and 
small  size  fragments  (<3  kDa)  were  purified  for  MALDI- 
TOP  analysis.  Previously,  MADLI-TOP  has  been  used  to 
identify  proteins  by  analysis  of  tryptic  digests  (31).  Mass 
spectra  revealed  that  PAP  was  cleaving  collagen  I  at  specific 
sites.  However,  sites  could  not  be  identified  by  MALDI- 
TOP  because  each  mass  peak  frequently  matched  more  than 
10  individual  sequences  within  the  collagen  I  protein.  An 
attempt  was  also  made  to  perform  an  LC/tandem  MS/MS 
analysis  on  these  fragments.  However,  the  mass  spectroscopic 
data  could  not  be  solved  because  of  collagen’s  several  post- 
translational  modifications  (PTMs)  that  include  proline 
hydroxylation,  glycosylation,  and  cross-linking.  Many  of 
these  PTMs  have  not  been  characterized  within  collagen  I, 
and  therefore,  we  determined  that  nonrecombinant,  purified 
human  collagen  I  could  not  be  easily  used  for  such  an 
analysis. 

To  solve  these  problems,  we  identified  a  source  of 
recombinant  gelatin  derived  from  human  collagen  I  that  had 
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no  PTMs  and  which  could  be  used  for  PAP  cleavage 
mapping  (22).  After  validating  the  LC/MS  based  method 
using  the  8.5  kDa  gelatin  fragment,  we  proceeded  to  generate 
the  map  of  all  PAP  cleavage  sites  within  full-length  100  kDa 
human  collagen  I  derived  gelatin.  Sequence  analysis  of  the 
51  peptide  fragments  from  this  digest  confirmed  earlier 
reported  studies  documenting  that  PAP  has  a  preference  for 
the  dipeptide  GP  in  the  P2— PI  position.  However,  these 
earlier  studies,  based  on  a  positional  scanning  method, 
reported  a  strict  requirement  for  G-P  for  PAP  hydrolysis.  In 
contrast  to  these  results,  we  documented  that,  while  PAP 
prefers  cleavage  after  the  G-P  dipeptide,  PAP  can  also  cleave 
peptides  containing  other  amino  acids  in  the  PI  position  (i.e., 
Ala,  Arg,  Asp,  Gly,  Glu,  Lys,  Ser,  and  Val). 

One  disadvantage  of  the  positional  scanning  synthetic 
approach  is  that  it  often  does  not  yield  much  specific 
sequence  information  beyond  the  P2— P3  position  of  the 
peptide.  Por  the  case  of  PAP,  Edosada  et  al.,  using  positional 
scanning,  documented  a  preference  for  Ser  or  Ala  in  the  P3 
position  (32).  This  finding  also  emerged  from  our  cleavage 
map  analysis,  in  which  Ser/Thr  was  observed  in  24%  of  the 
sequences  in  P3  and  Ala  was  observed  in  1 1%.  In  the  study 
of  Edosada  et  al.,  no  preference  could  be  observed  in  P4 
(32).  Using  a  map  of  a  known  protein  substrate,  in  this  case, 
allowed  us  to  obtain  additional  sequence  information  beyond 
P3,  with  sequence  preferences  emerging  for  P4,  P6,  and  P7 
from  the  analysis  of  the  cleavage  map. 

An  added  advantage  of  this  LC/MS/MS  sequencing 
approach  is  that  it  allowed  us  to  quantify  the  abundance  of 
each  identified  peptide  by  comparing  the  extracted  ion  current 
for  each  of  the  observed  peptide  parent  ions  from  the  MS 
spectra.  The  contribution  of  each  parent  ion  to  the  total  ion 
current  could  then  be  extracted  and  integrated  over  the 
peptide  elution  peak.  This  approach  allowed  us  to  rank  the 
peptide  fragments  on  the  basis  of  the  extracted  ion  current. 
Traditional  Edman  sequencing  does  not  provide  the  potential 
for  such  fragment  stratification.  While  it  was  already  known 
before  this  study  that  PAP  preferred  cleavage  after  proline, 
this  information  could  also  have  been  learned  initially  from 
analysis  of  the  cleavage  fragments  ranked  by  the  extracted 
ion  current. 

Since  the  abundance  of  each  individual  fragment  depends 
on  cleavage  from  two  sites,  the  extracted  ion  current  for  each 
fragment  represents  an  “average”  based  on  the  efficiency  of 
PAP  cleavage  at  both  sites.  Therefore,  initially,  we  thought 
the  ranking  would  provide  no  additional  information  beyond 
the  prediction  of  a  preferred  amino  acid  in  the  PI  position. 
To  evaluate  the  utility  of  this  ranking  based  on  ion 
abundance,  we  synthesized  seven  fluorescently  quenched 
peptide  substrates  based  on  peptide  sequences  from  fragments 
with  high,  intermediate,  and  low  extracted  ion  currents  and 
determined  PAP  hydrolysis  kinetics  for  each.  The  extracted 
ion  current  rankings  corresponded  well  with  the  k^  JK^  ratios 
of  these  peptide  substrates,  with  the  highest  ranked  having 
the  highest  ratio,  etc.  Although  based  on  a  limited  sample 
of  peptide  substrates,  these  results  suggest  that  this  method 
of  ranking  cleavage  sites  could  be  applied  to  the  analysis  of 
cleavage  maps  of  other  proteases  with  unknown  substrate 
specificity  to  help  identify  the  best  substrates  for  further 
synthesis  and  evaluation. 

In  conclusion,  we  have  used  an  LC/MS/MS -based  method 
to  generate  a  map  of  all  PAP  cleavage  sites  within  recom- 
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binant  human  collagen  I  derived  gelatin.  From  this  map,  we 
identified  potential  consensus  peptide  sequences  that  could 
be  used  to  generate  FAP-selective  peptide— cytotoxin  pro¬ 
drugs.  These  FAP-activated  prodrugs  would  selectively  target 
FAP-producing  myofibroblasts  within  the  stroma  of  epithelial 
tumors  and,  therefore,  would  represent  a  potential  “pan¬ 
tumor”  therapeutic  agent.  Further  studies  are  under  way  in 
our  laboratory  to  assess  the  efficiency  and  specificity  of  these 
consensus  peptides  for  FAP  hydrolysis  compared  to  those 
of  other  proteases.  In  addition,  we  are  further  evaluating  the 
role  of  proline  hydroxylation  of  some  of  these  substrates  to 
determine  whether  such  modification  enhances  FAP  hy¬ 
drolysis. 
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